List of Flash News about CBRN filtering classifier
Time | Details |
---|---|
2025-08-22 16:19 |
AnthropicAI: Classifier Cuts CBRN Accuracy by 33% Beyond Random Baseline With No Benign Task Impact | AI Safety Update
According to @AnthropicAI, a classifier setup reduced CBRN accuracy by 33% beyond a random baseline; source: @AnthropicAI. The source also reports no particular effect on a range of other benign tasks, addressing concerns that filtering CBRN data would harm harmless scientific capabilities; source: @AnthropicAI. |